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We show that an economic system populated by multiple agents generates an equilibrium distribu- 
tion in the form of multiple scaling laws of conditional PDFs, which are sufficient for characterizing 
the probability distribution. The existence of the double scaling law is demonstrated empirically 
for the sales and the labor of one million Japanese firms. Theoretical study of the scaling laws 
suggests lognormal joint distributions of sales and labor and a scaling law for labor productivity, 
both of which are confirmed empirically. This framework offers characterization of the equilibrium 
distribution with a small number of scaling indices, which determine macroscopic quantities, thus 
setting the stage for an equivalence with statistical physics, bridging micro- and macro-economics. 



PACS numbers: 02.50.-r, 05.70.-a, 89.65.-s, 89.75. Da 

Economics is in crisis. Although there exists a main- 
stream approach PQ H] , its internal coherence and ability 
in explaining empirical evidences are increasingly ques- 
tioned. The causes of the present state of affairs go back 
to the mid of the XVIII century, when new figures of so- 
cial scientist {economists) borrowed the method [math- 
ematics) of the most successful hard science [physics) 
allowing for the mutation of political economy into eco- 
nomics. It was, and still is, the Newtonian mechanical 
physics of the XVII century, which rule economics. 

From then on, economics lived its own evolution based 
on the classical physics assumptions (reductionism, de- 
terminism and mechanicism) . Quite remarkably, Keynes 
adopted the approach of statistical physics, which deeply 
affected physical science at the turn of the XIX century 
by emphasizing the difference between micro and macro, 
around the mid 1930s [3]. However, after decades of ex- 
traordinary success it was rejected by the neoclassical 
school around the mid 1970s, which framed the discipline 
into the old approach and ignored, by definition, any 
interdependence among economic agents (firms, banks, 
households) and difference between microscopic individ- 
ual behavior and macroscopic aggregate behavior. 

The ideas of natural laws and equilibrium were 
transplanted into economics sic et simpliciter. As a 
consequence of the adoption of the classical mechan- 
ics paradigm, behavior of macro-economic systems are 
treated as a scaled- up version of one individual agent, 
who is called Representative Agent (RA), and complex- 
ity that emerges from aggregation was lost. Any learned 
physicist knows that this is entirely wrong for physi- 
cal systems with many constituents: macroscopic behav- 



ior of gas is qualitatively different from that of a sin- 
gle molecule. Likewise, economy of a country is not ex- 
plained by analysing a single RA as if he is on a deserted 
island all by himself, like Robinson Crusoe without even 
Friday, and multiplying the number of population to the 
results. 

Another quite a dramatic example is the concept of 
equilibrium. In many economic models equilibrium is 
described as a state in which (individual and aggregate) 
demand equals supply. The notion of statistical equi- 
librium, in which the aggregate equilibrium is compati- 
ble with stochastic behavior of the constituents, is out- 
side the box of tools of mainstream economists. Again, 
physics teaches us that the equilibrium of a system does 
not require that every single element be in equilibrium 
by itself, but rather that the statistical distributions de- 
scribing macroscopic aggregate phenomena be stable. 

What modern physics can do for economics is, then, 
to open a way to a proper treatise of macro economy as 
an aggregation of individual economic agents, which is 
one main theme of econophysics on real economy jH [5] . 
Such a thought is not totally unfamiliar to open-minded 
economists, whose keyword is Heterogeneous Interact- 
ing Agents (HIA) [6], where 'heterogeneity' implies that 
each has different characteristics; different financial pro- 
file like different energy and momentum, and 'interaction' 
is trade with exchange of money, goods, workers, infor- 
mation, etc. just as physical particles interact with each 
other. 

In this letter we will show that a system populated 
by many HIA generates equilibrium distribution in the 
form of scaling laws. In particular, economic literature 
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has shown the existence of large and persistent differ- 
ences in labour productivity across industries and coun- 
tries [THS] . Productivity is often measured in terms of the 
ratio between firms' revenues and the number of employ- 
ees, ft can be expected to be a unique value only within 
a very straight jacket hypothesis, such as the Represen- 
tative Agent. If agents are heterogeneous and interact, 
then scaling laws emerge, and dispersion is nothing but 
a consequence of it. Using Japanese data we empirically 
demonstrate it. The conclusive remarks points out that 
a thermodynamical approach (see also [TU1 E] ) may be 
what economics needs if HIA are the actors of the drama. 

For the purpose to study properties in probability dis- 
tributions, it is essential to observe a large portion of 
the entire population of firms and workers. A database 
of only listed firms, for example, is insufficient to ana- 
lyze properties of distributions. We employ the largest 
database of Credit Risk Database (CRD) in Japan (years 
1995 to 2009), which includes a million firms and fifteen 
million workers in the year 2006, covering the large por- 
tion of the whole domestic population. Below we give our 
analysis for the year 2006, but note that the qualitative 
results are valid for other years as well. 

We measure the value added Y and the labor L of each 
firm to have the information of output and input in the 
production at the individual level. We use simply the 
business sales/profits as a proximity to the value added, 
and the end-of-year number of workers (excluding man- 
agers) as the labor in order to calculate distributions for 

Y and L and to uncover their properties. 

To understand how workers are distributed among dif- 
ferent levels of output and productivity, we shall study 
the distributions of Y and L using the following prob- 
ability density functions (PDFs). The joint PDF, Pyl, 
the conditional PDFs, Py\L an d Pl\y, an d the marginal 
PDFs, Pl and P Y , are defined by 

Pyl(Y,L) = P YlL (Y\L)P L (L) = P LlY (L\Y)P Y (Y). (I) 

The conditional average of f(Y) is defined by 

E(f(Y)\L) := / f(Y)P YlL (Y\L)dY, (2) 
Jo 

for an arbitrary function /(•), and similarly for 
E(f(L)\Y). 

As we shall see, since the PDFs are heavy-tailed for 

Y and L, it is convenient for the purpose of statistical 
analysis to take the logarithms of variables: 
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(3) 



where lo and Lq are arbitrary scales. 

Fig. [I] shows the scatter plot for (£,y). To reveal sta- 
tistical structure in data, which can be easily missed by 
parametric methods, we employ a kernel-based nonpara- 
metric methods [12] . Fig. [T] depicts a nonparametric re- 
gression curve with error bars (95% significance level) for 



- 1 — i — i — i — i — i — i — i — i — i — I — i — i — i — i — i — i — i — i — r 




FIG. 1. Scatter plot for (L, Y) (gray dots). The curve 
with error bars is the nonparametric estimation for E(y\£) := 
E(lnY\L), while the thin straight line is the linear regression 
(both axes are in units of In 10.) 



E(y\£) = E(hxY\L). We can observe that there exists a 
range 10 07 < L < 10 2 5 for which the relation: 



E(y\£) = a£ + const., 



(4) 



holds where a is a constant. In fact, the goodness of fit for 
nonparametric regression (R 2 = 44.04%; see [T3] for the 
definition) has a same level as that for linear regression 
{R 2 = 44.03%) for the range, the estimation of which 
gives the estimation, a = 1.037(±0.003) (shown by a 
straight line in Fig. [l~j . 

Similarly, for the range of 10 4 5 < Y < 10 70 , we have 
another relation, namely 



E(£\y) = f3 y + const. 



(5) 



with a constant /3. The validity for this relation is 
checked by the nonparametric (R 2 — 47.18%) and lin- 
ear (R 2 = 47.09%) regressions, the latter of which gives 
the estimation, /3 = 0.655(±0.002). 

We find that these relations are simple consequences 
from two scaling relations for the conditional PDFs, 
P Y \ L {Y\L) and P L]Y (L\Y). Fig. ffl (a) depicts the con- 
ditional PDF, Py\l{Y\L), with the conditioning values 
of L are chosen at a logarithmically equal interval cor- 
responding to the range 10 07 < L < 10 20 in terms of 
histograms. By using the values of a estimated above, 
we find that the conditional PDF obeys a scaling relation: 



Py\l(Y\L) 



L 
Lo 



scaled J 



(6) 



where Scaled := (L/Lq) a Y and $y(-) is a scaling func- 
tion, as shown by the fact that the PDFs P Y ^(Y\L) 
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FIG. 2. (a) The conditional PDF Py\l(Y\L) for L G [5, 200], where the conditioning values of L are chosen at a logarithmically 
equal interval, (b) The scaled conditional PDF $y (Scaled) defined by Eq. |6J. Dots are the scaled data points and the curve 
is the lognormal function given by Eq. (161. 




FIG. 3. The scaled conditional PDF $ .ensealed) defined by 
Eq. 0. The curve is the lognormal function given by Eq. |T8| ) . 



for different values of L fall onto a curve depicted in 
Fig. [2] (b). It is straightforward to show that Eq. Q 
follows from Eq. 

Similarly, we have another scaling relation for 



Pl\y(L\Y) = 



^L^scaled) j 



(7) 



where i sca icd := (Y/Yq)~^L and is a scaling func- 

tion, as shown by Fig. [3] And also Eq. |5| immediately 
follows from Eq. ([7]). 

The two scaling laws, Eqs. (JsJ) and ([7]), which we col- 
lectively call the double scaling law (DSL) have strong 



consequences to the function form of the joint PDF. Let 
us briefly describe them in the following. 

Let us choose the reference scales Y"o an d Lq to be 
within the region of the (Y, i)-plane where DSL is valid. 
Then by substituting Y = Y , L = L into Eqs. @, ^ 
and Q, We obtain the marginal PDFs, Py and Pl in 
terms of the invariant functions, <&y and $l: 



Pl(L) = 



Py(Y) = 



L \ a $ L (£) <S> Y (Y ) 
L Q J <f> L {L )<Z> Y ({L/L )-°'Y ) 

Y ) <S> Y (Y Q )<f> L ((Y/Y )-PL ) 



Pl(L ) 



(8) 



Py(Y ). 



(9) 



From Eqs. (|6| and Q and the above, we arrive at the 
following equation for the $s: 

* L ((Y/Y )-f>L) <S> L (L ) = <j> y ((L/L )- a Y) <P Y (Y Q ) 

(10) 

This equation puts strong constraints the form of $'s. 
We have converted the above to differential equations 



and have derived complete solutions of Eq. (10l.[14j De- 
pending on whether a/3 = 1 or not, the solution is quali- 
tatively different, which we shall explain below. 

When a/3 = 1, we find the following relation between 



$'s is necessary and sufficient for Eq. ( 10 ) 



ML) = *y((L/L )-«Y )(±YP^. ,11) 



In other words, we have one arbitrary function in the 
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solution. In this case, Eqs. ([8| and ^ implies that 

-fj-L— l 



Pl(L) 
Py(Y) 



Pl(L ), 



with 



a = — , p = . 

Py PL 



Py(Y ), 



Hy 



(12) 
(13) 

(14) 



This result is straightforward to understand: Due to 
aB = 1, we have 



^scaled YL a X Scaled- 



(15) 



Therefore, an arbitrary function of Kjcaicd is a function 
of Scaled as f ar a s dependence on Y and L is concerned. 
This is why an arbitrary function is left in the solution. 



Also the marginal PDFs in Eqs. (12 1 and (13 1 results 
from the relation Q. 
For a/3 ^ 1, we obtain, 



$ L (L) = e- a ^ 2+ ^$ L (L ), 

P L (L) = e -«(i-^ 2 +( S +( 9 +i)o)£ Fi ( io)i 

P Y (Y) = e~W- Q ' 3 )ra i '+(<i+(s+i)P)v p Y (Y Q ) . 
The joint PDF is given by the following: 



(16) 
(17) 
(18) 
(19) 



Pyl(Y,L) = e- a P e + 2a P pey -PPv 2+se+,lv P YL (Yo,Lo). 

(20) 

We find that in the limit a/3 -4 1 we obtain the power 
laws for Pl(L) and Py(Y), which is consistent with the 



results (12 1 and (13) above. 



The parameter of p is estimated in two ways: The 
best fit of the theoretical expression ( 16 1 in Fig. [2] (b) 
yields p = 0.692(±0.027), while Ec~|l6]) in Fig. [| 
yields p = 0.704(±0.016). These measured values of p 
agree with each other very well, with combined value 
p = 0.698(±0.025), assuming equal weight and no corre- 
lation. The marginal PDF P L (L) and Py(Y) in Eqs. (fT8l) 



and ( 19 ) agrees with empirical data very well with these 



values of p. These analysis show that our theoretical re- 
sults above are in good agreement with data. 

We stress that the above results are proven in the lo- 
cal region of the (Y, L)-plane where DSL is valid. On 
the other hand, if the lognormal PDF (20) is valid every- 



where on the (Y, L) plane, one may obtain the marginal 
PDFs and $s as given in Eq. (fi7}-([l9]), as Pl{L) can be 
obtained by integrating Eq. (20) over Y g [0,oo), and 
then obtain $ L {L) in Eq. (|17[) from Eqs. ^ and Q, 



and so forth, which constitute easy checks of the relation 
between various functions. 



Let us now study the labor productivity C :— Y/L in 
case of aB ^ 1, as such is the reality as shown empiri- 
cally. The joint PDF of (C, L), P C l(C, L) is given by the 
following: 



Pcl(C,L) = LP yl (CL,L). 



(21) 



Substituting the expression ( 20 ) into the above, we find 



that is express, we obtain that Pcl(C,L) too is of log- 
normal form like the r.h.s. of (20) with a,B,p replaced 
by 



1, f3:=B 



a 



1 



a + 8-2a8' 



, a + 8- 2a8 
P ■= ; P, 



a 



1 



(22) 
(23) 



respectively. By substituting empirical values found for 
a, 8 and p (the average of the two central values of p 
found above), we find that they are a — 0.037(±0.003), 
P = 0.072(±0.007), p = 6.353(±0.680). By comparing 
this with Eq. (19), we find the following marginal PDF 



for the productivity C: 



P C {C) oc e'^ lC + T 2C , 
aP{\-aB) 



7i 



a + P - 2a8 



p. 



(24) 
(25) 



where c := ln(C/Co) with Cq := Yq/Lq and the mea- 
sured values of a,B,p yield 71 = 0.456(±0.017). Also, 
the conditional PDF P L \ C (C,L) satisfies the scaling law; 



L\C 



(C,L) 



' C 
Co 



L 



$i oc e 



-/3pc +qc 



(26) 
(27) 



while the average of the labor L for a given productivity 
C is given by the following; 



E[L\C] 



C_ 
C 



(28) 



Fig. [4] checks the result of Eq. (28) with the parameter 
P estimated from the relation in Eq. (27), where the thick 



straight line is the theoretical calculation. Fig. [5] depicts 
the scaling function ^E'(-), where the curve is given by 
the fitting in Eq. ( 27 ) . Both of these results confirm our 



results in the region where the scaling relations are valid. 

In this paper, we have shown that the Japanese data 
for some one million firms show that the firm distribu- 
tion in (Y, L) plane satisfy the double scaling law (DSL). 
We have shown that DSL leads to either power-law for 
the marginal PDF of Y and L, or the lognormal PDF 
for the joint PDF of Y and L. Although we have con- 
centrated on these specific two variables because of their 
importance for the labour productivity, we believe that 
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FIG. 4. Scatter plot for (C,L) (gray dots). The curve 
with error bars is the nonparametric estimation for E(£\c) := 
E(\n L\CY while the thick straigh t lin e is the theoretical pre- 
diction with P given by Eq. (Eil (in units of In 10). 




FIG. 5. The scaled conditional PDF ^(Lscaied) defined by 
Eq. (27 1. The curve is the lognormal function given in the 



same equation. 



other financial quantities obey DSL. Furthermore, joint 
PDF of more than two variables are expected to obey 
the similar scaling laws, which yield extensions of the re- 
sults explained above, including multi-variate lognormal 
distributions, yielding simple relations such as Eq 
between scaling exponents 



(25) 



These laws offer straightfor- 
ward guidance to the method of aggregation: the macro- 
scopic quantities are now expressed in terms of just a few 
parameters in the PDFs, conditioned by DSL. Thus scal- 



ing laws and the resulting lognormal distributions should 
be the basic ingredient of economics of HIA, which is an 
equivalence of the statistical physics, bridging micro and 
macro economy. 
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